Plant Phenomics

Research Article | Open Access

Volume 2023 |Article ID 0084 | https://doi.org/10.34133/plantphenomics.0084

Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models

Amogh Joshi,^1,2,3 Dario Guevara,^1,2,3 Mason Earles ^1,2,3

¹Department of Viticulture and Enology, University of California, Davis, Davis, CA, USA
²Department of Biological and Agricultural Engineering, University of California, Davis, Davis, CA, USA
³AI Institute for Next-Generation Food Systems (AIFS), University of California, Davis, Davis, CA, USA

Received 23 Jan 2023	Accepted 02 Aug 2023	Published 06 Sep 2023

Abstract

In recent years, deep learning models have become the standard for agricultural computer vision. Such models are typically fine-tuned to agricultural tasks using model weights that were originally fit to more general, non-agricultural datasets. This lack of agriculture-specific fine-tuning potentially increases training time and resource use, and decreases model performance, leading to an overall decrease in data efficiency. To overcome this limitation, we collect a wide range of existing public datasets for 3 distinct tasks, standardize them, and construct standard training and evaluation pipelines, providing us with a set of benchmarks and pretrained models. We then conduct a number of experiments using methods that are commonly used in deep learning tasks but unexplored in their domain-specific applications for agriculture. Our experiments guide us in developing a number of approaches to improve data efficiency when training agricultural deep learning models, without large-scale modifications to existing pipelines. Our results demonstrate that even slight training modifications, such as using agricultural pretrained model weights, or adopting specific spatial augmentations into data processing pipelines, can considerably boost model performance and result in shorter convergence time, saving training resources. Furthermore, we find that even models trained on low-quality annotations can produce comparable levels of performance to their high-quality equivalents, suggesting that datasets with poor annotations can still be used for training, expanding the pool of currently available datasets. Our methods are broadly applicable throughout agricultural deep learning and present high potential for substantial data efficiency improvements.

Research Article | Open Access

Volume 2023 |Article ID 0084 | https://doi.org/10.34133/plantphenomics.0084

Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models

Amogh Joshi,^1,2,3 Dario Guevara,^1,2,3 Mason Earles ^1,2,3

Abstract

Fulltext

PDF

Download Citation

Submit Manuscript

Research Article | Open Access

Volume 2023 |Article ID 0084 | https://doi.org/10.34133/plantphenomics.0084

Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models

Amogh Joshi,1,2,3 Dario Guevara,1,2,3 Mason Earles 1,2,3

Abstract

Fulltext

PDF

Download Citation

Submit Manuscript

Amogh Joshi,^1,2,3 Dario Guevara,^1,2,3 Mason Earles ^1,2,3